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I .   INTRODUCTION 

The  present  paper  treats  the  problem  of  estimating  price-quantity 
relationships  which  vary  over  crosE  sectional  units  and  over  time.  The 
units  chosen  are  the  fifteen  largest  brands  in  the  margarine  market,  and 
the  time  periods  are  weeks.  The  data  cover  one  year  (1965/66)  and  are 
consumer  panel  observations  (they  are  MRCA  data)  aggregated  over  individual 
consumers'  purchases.  Although  the  analysis  in  this  paper  concerns 
brands  over  time,  it  is  straightforward  to  treat  the  question  of  over 
time  variations  for  individual  panel  members,  for  example. 

The  price-quantity  relationship  has'  a  great  Interest  in  Itself,  not 
only  for  economists  but  also  for  producers,  marketers,  and  retailers. 
The  emphasis  in  this  paper,  however,  is  upon  the  estimation  techniques 
Involved.  The  reason  Is  simply  that  it  is  there  I  see  the  paper's  main 
contribution;  it  is  also  where  I  have  done  most  of  the  work  for  the 
paper.  There  are,  however,  substantive  results  from  the  research  and 
I  will  spend  some  time  on  those  towards  the  end. 

That  different  brands  would  show  different  sensitivity  to  price 
can  reasonably  be  argued  on  an  a  priori  basis  —  some  brands  simply  use 
price  as  an  important  ingredient  in  the  marketing  mix,  others  do  not. 
Later  on  we  will  see  empirical  evidence  supporting  this  argument.   In 
addition,  it  can  be  argued  that  over  time  a  brand's  sensitivity  might 
undergo  changes.   For  example,  much  like  the  "wheel  of  retailing" 
phenomenon,  the  low-price  private  brands  sometimes  undergo  a  quality 
improvement  with  concomitant  price  rises.  The  higher  priced  name  brands 
might  then  be  forced  to  compete  more  on  a  price  basis  than  before  due  to 
this  competitive  pressure  which  has  affected  the  price  sensitivity  of  the 
brands.  Again,  empirical  evidence  on  these  changes  will  be  discussed  below. 
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As  a  consequence  of  these  possible  variations,  managers  as  well 
as  analysts  are  faced  with  relationships  between  prices  and  quantities 
which  mighc  be  different  for  dif f e. ant  brands  as  well  as  different  over 
time.  This  situation  has  in  the  past  been  very  difficult  to  handle,  since 
it  implies  that  one  cannot  observe  the  same  response  phenomenon  more 
than  once.  How  can  statistical  techniques,  or,  for  that  matter,  managerial 
experience  be  brought  to  bear  on  situations  which  may  be  unique  to  each 
brand  and  different  at  each  point  in  time?  It  is  this  question  we  will 
develop  a  possible  answer  to  in  what  follows. 

II.   THE  PRICE-QUANTITY  RELATIONSHIP 
Basic  Model 

According  to  standard  economic  micro  theory  price  ought  to  be  an 
important  determinant  of  the  quantity  purchased  of  a  product  such  as 
margarine.  Substitutes  exist,  especially  butter,  of  course,  and  within 
the  product  group  there  are  several  competing  brands.   In  addition,  no 
evidence  is  available  to  suggest  that  margarine  is  an  inferior  product, 
even  if  there  has  been  some  indica'  Ions  in  the  daily  nress  that  some 
people  change  from  butter  to  margarine  in  "hard  times".  All  in  all,  one 
appropriate  model  might  be  the  one  depicted  as  (1)  in  Exhibit  1. 

Here  q  ^   =»  the  quantity  (in  lbs.)  bought  of  brand  n  at  time  t; 

^nt  "^  ^^^   price  (in  dollars)  of  brand  n  at  time  t; 

^  t  -   the  mean  price  in  the  market  at  time  t; 

Pgt  =  the  market  price  index  for  butter  at  time  t. 
The  a^,  i=»0,...,  3  are  unknoT^  coefficients,  the  last  three  of  which  would 
hypothetically  equal  or  be  less  than  zero.  The  e^^^  error  term  picks  up 
influences  from  other  variables  not  explicitly  considered  in  the  equation. 
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To  the  extent  that  the  left-out  variables  interact  or  are  strongly 
correlated  with  any  of  the  price  variables,  the  model  is  clearly  too 
simpleminded.  The  price  coefficients  will  in  such  cases  be  biased  and 
exhibit  spurious  correlation.  Nevertheless,  since  panel  data  such  as 
these  are  very  common  and  do  not  include  much  beside  price  effects,  it 
will  be  of  interest  to  see  if  anything  about  these  effects  can  be 
learned  from  the  analysis. 

These  panel  data  do  contain  information  on  deal  purch&aes,  however. 
It  is  likely  that  for  a  product  such  as  margarine  the  typical  interactions 
between  price  and  other  competitive  means  (advertising  and  in-store 
promotions,  for  example)  occur  when  some  type  of  price-cutting  deal  on 
a  brand  is  offered.   By  eliminating  from  the  analysis  those  individuals 
who  bought  on  a  deal  basis  we  will  be  able  to  eliminate  at  least  oxie 
sizable  source  of  possible  spuriousness. 

Since  the  number  of  panel  members  buying  margarine  will  vary  from 
week  to  week,  it  is  also  necessary  to  standardize  the  quantity  variable 
by  the  number  of  people  buying.  This  means,  of  course,  that  the  analysis 
does  not  consider  those  possible  price  effects  which  make  more  people 
buy  in  one  week  rather  than  another. 

As  a  result  of  these  procedural  steps,  the  analysis  to  follow  will 
evaluate  the  model  with  the  dependent  variable  being  the  mean  quantity 
bought  by  those  individuals  who  did  not  buy  on  a  deal  basis.   The  price 
variable  P  ^  is  computed  as  the  mean  price  per  pound  paid  by  these 
individuals.   The  mean  is  calculated  on  a  weighted  basis,  so  that  a  large 
purchase  at  a  low  price  is  counted  heavier  than  a  small  purchase  at  a 
high  price. 
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Slnce  the  early  tuns  of  the  model  revealed  high  serial  correlation 
and  multicollinearity  —  both  of  which  could  be  expected  —  some 
transformations  were  made  before  the  final  runs.   First,  the  average 
price  in  the  margarine  market  was  recomputed  so  as  to  e^'.clude  the  particular 
brand  being  analyzed.  Than  under  the  assumption  of  no  reaction  to  a 
moderate  price  change  by  "our"  brand  under  analysis ,  a  first  difference 
version  of  (1)  was  developed.   It  is  depicted  as  equation  (2)  of 
Exhibit  1. 

Since  there  was  still  considerable  multicollinearity  between  the 
simple  price  difference  and  the  butter  price  variable,  the  decision  was 
made  to  drop  the  latter  variable  from  the  analysis.   Attempts  to  fit 
non-linear  forms,  including  a  quadratic,  produced  no  particular 
improvements  in  either  fit  or  significance  of  coefficients.  The  final 
form  of  the  model  used  for  the  estimation  became  that  of  equation  (3) 
in  Exhibit  1. 

The  constant  a  was  introduced  as  a  scale  factor.  The  asterisks 
indicate  simply  that  these  coefficients  are  not  the  same  as  those  in 
(1)  and  (2)  because  of  the  elimination  of  the  butter  price  term. 
Are  the  Differences  Significant? 

In  the  introduction  we  claimed  that  there  are  a  priori  reasons  for 
allowing  the  price-quantity  relationship  to  vary  across  brands  and  over  time 


^The  butter  price  index  is  published  monthly  by  the  Department  of 
Agriculture.  The  elimination  of  the  butter  variable  resolved  also  the 
question  of  what  to  do  about  within  month  variations  which  were  not 
recorded. 
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"faced  with  the  data,  the  first  question  one  naturally  asks  is  whether 
these  differences  are  significant  for  this  particular  sample.  To  find 
out,  I  used  the  tests  developed  by  Fisher  and  also  used  by  Bass  & 
Wittink  in  their  paper  presented  here. 2 

For  the  texts  several  regression  runs  were  necessary.   First,  an 
overall  run  with  all  brands  and  all  time  periods  (51  weeks)  pooled  was 
made.   This  provided  one  standard  of  comparison,,  where  slope  coefficients 
as  well  as  intercept  were  supposed  to  be  the  same  for  all  brands.   A 
""cond  run  allowed  fifteen  separate  intercepts  but  held  the  two  price 
coefficients  fixed.   Thirds  completely  separate  regressions  were  made, 
one  for  each  brand,  allowing  all  the  parameters  to  be  different  across 
brands. 

The  results  from  these  runs  are  presented  in  Tables  1  and  2.   The 

appropriate  F-statistic  to  test  whether  the  brands'  coefficients 

differ  significantly  from  the  overall  regression  was  equal  to  5.62  for 

the  pooled  run  with  only  one  common  intercept.   The  critical  value  of 

^.05  762,720  ~  1.16,  and  thus  the  differences  are  very  significant. 

Allowing  the  intercepts  but  not  the  slopes  to  vary,  the  F  was  computed 

z:.   equal  to  3.85  —  the  critical  F  -.^  ^.^   -jo.a  is  about  the  same  as 

.05,748,720 

before  —  and  the  differences  are  again  significant. 


^Fisher,  F.M. ,  "Tests  of  Equality  Between  Sets  of  Coefficients  in  IVo 
Linear  Regressions:  An  Expository  Note,"  Econometrica,  Vol.  38,  No. 2 
(March,  1970),  pp.  361-67,  and  Bass,  F.M.  &  D.  Wittink,  "Pooling  Issues 
and  Methods  in  Regression  Analysis  with  Examples  in  Marketing  Research," 
Working  Paper,  Krannert  Graduate  School  of  Industrial  Administration, 
1974. 
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To  complete  the  testing  of  across  brand  differences  one  should 
actually  run  pooled  regressions  where  one  but  not  the  other  of  the 
two  price  coefficients  is  allowed  t  •>   vary.   I  did  not  do  this  —  it 
Involves  running  a  set  of  15  times  2  regressions  —  since  judging  from 
the  slope  estimates  in  the  tables  it  seems  quite  clear  that  one  can 
assiime  the  differences  to  be  significant  again. 

Turning  to  over-tiiae  differences  ^  the  analysis  was  rerun  with 
separate  regressions  for  each  of  the  51  tirse  periods ,  using  15  brands 
as  the  data  points.  The  tests  used  the  pooled  versions  as  before,  one 
with  51  different  intercepts.  As  before^  the  results  indicated 
significant  differences  in  the  brands'  price  coefficients  at, the  .C5  level. 

III.   A  STOCHASTIC  PAEAMETER  MODEL 
The  Model 

At  this  stage  one  could  have  terminated  the  analysis  and  attempted 
to  interpret  the  findings,  or  one  could  continue  the  analysis  with  mora 
flexible  pooling  models.  Since  the  first  option  was  undesirable  considering 
some  of  the  estimates  in  the  tables,  the  second  line  x^aa   pursued.   The 
next  type  of  model  one  would  usually  attempt  here  is  the  error  components 
structure,  perhaps  succeeded  by  a  random  coefficient  model.   Instead,  I 
went  directly  into  what  is  sometimes  called  stochastic  parameter  models. 
There  were  two  reasons  for  this.  One,  the  rariations  exhibited  by  the 
regression  coefficients  made  it  unlikely  that  one  or  two  components  of 
the  composite  equation  error  would  be  enough  to  handle  the  diversity. 
Two,  there  was  no  random  coefficient  program  available  at  the  U.  of  I. 
and  since  the  algorithm  I  had  available  for  the  stochastic  parameter  case 
included  a  version  of  random  parameters  as  a  special  case  there  was  no 
reason  to  hesitate. 


-7- 

One  might  wonder  whether  in  fact  the  topic  of  possible  pooling 
should  be  pursued  at  all  with  these  data  considering  the  results  so  far. 
There  are  vo   clues  as  to  what  migh*  make  for  homoganeous  subgroups,  fo 
instance.   Usually  the  distinction  between  private  and  national  brands 
(the  private  brands  here  are  A&P's  Ann  Page  and  Nutley,  and  Kroger *8 
Eatmore)  would  be  expected  to  manifest  itself  in  differing  price  sensitivity, 
but  not  so  here.   In  fact»  if  the  interest  focuses  mainly  on  one  brand 
the  analysis  should  probably  go  on  for  that  brand  alone.   If  the  interest 
lies  more  in  a  general  characterisation  of  the  price  sensitivity  in 
this  market  as  a   whole,  however »  we  would  still  like  to  pursue  possible 
coimnunalitles  between  the  brands. 

The  perhaps  most  general  form  of  the  stochastic  parameter  regression 
model  is  probably  the  convergent  regression  structure  due  to  Rosenberg. ^ 
The  basic  form  of  the  model  is  shown  in  Exhibit  2. 

Equation  (4)  is  the  main  model  expressing  the  dependent  variable 
as  a  function  of  two  types  of  independent  variables.   One  is  the  w-  j^, 
i*»l,...,K,  which  comprise  coefficients  fixed  for  cross  sectional  units. 
The  other  type  is  the  Zj.^^,   j-l,,.  ..^L,  where  different,  cross  sectional 
units  are  allowed  differing  coefficients.  The  5^  is  the  Kronecker  delta 
equal  to  one  for  s~t,  zero  otherwise.  The  disturbance  in  the  equation, 
xijxt*   ^*®  ^  mean  of  zero  and  no  serial  correlation.  There  is  assumed  to 


-^See,  in  particular,  Rosenberg^  B. ,  "The  Analysis  of  a  Cross  Section  of 
Time  Series  by  Stochastically  Convergent  Parameter  Regression,"  Annals 
of  Economic  and  Social  Measurement,  Vol, 2,  No. 4  (October,  1973),  pp. 
399-427.  This  issue  focuses  on  variable  parameter  regression  models. 
The  discussion  of  the  convergent  parameter  model  which  follows  here 
draws  directly  upon  the  Rosenberg  article. 
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be  no  correlations  between  individual  units'  disturbances  in  any  one 
time  period,  except  for  the  influence  of  a  common  variance  component,  R^. 
In  terms  of  our  price-quantity  relationship,  brands  are  similar  to  the 
extent  one  assigns  the  price  coefficients  as  cross  fixed  and  makes  the 
individual  error  variances  R^^  small  relative  to  Rq.      Conversely,  if  we 
denote  both  price  coefficients  as  cross  varying  and  let  the  Rjj's  get 
large  relative  to  Rq  the  more  individualized  to  the  particular  brand 
unit  will  the  analysis  become.   In  the  present  case  it  seems  clear  that 
the  latter  course  is  desirable.  As  we  will  see  later,  these  types  of 
prior  specifications  become  important  so  as  to  get  a  good  starting  point 
for  the  algorithm,  '  " 

The  second  part  of  the  structure  specifies  exactly  how  the  cross 
fixed  and  the  cross  varying  parameters  behave  over  time,  and  how  the 
latter  differ  between  cross  sectional  units.  These  so-called  transition 
relations  are  shown  as  (5)  and  (6)  in  Exhibit  2. 

The  connotation  of  a  convergent  parameter  structure  derives  from 
the  behavior  of  the  a^j^j-'s  which  tend  to  converge  to  the  mean  a^.  in 
proportion  to  (I  -  A,).   The  basic  argument  behind  this  structure  is  that 
in  many  economic  applications  differences  between  units  tend  to  be 
obliterated  over  time  as  firms,  or  brands,  or  individuals  tend  to  some 
common  "norm"  —  this  norm  being  estimated  by  a^.  At  the  same  time,  the 
random  shock  term  represented  by  r\  .    indicates  the  presence  of  competitive 
and'  other  forces  which  tend  to  counteract  the  movement  towards  the  norm  by 
differentiation  of  each  unit  away  from  the  mean.^ 


'^Strictly  speaking,  the  only  requirement  on  the  cross  varying  parameter  process 
(as  well  as  the  cross  fixed  process)  is  that  it  generates  a  stationary  process 
Without  sfcationarity  one  would  never  be  able  to  observe  the  same  phenomenon 
over  time  and  samples  greater  than  one  would  be  impossible  to  get. 


-9- 

In  our  application,  a  priori  expectations  would  be  that  price 
sensitivity  would  be  homogeneous  among  some  brands  (the  private,  say). 
As  the  estimates  have  indicated,  hov/ever,  the  norms  do  not  tend  to  be 
very  strong  in  this  market  and  differentiation  seems  much  more  powerful. 
Accordingly,  we  are  probably  faced  with  a  situation  where  the  divergence 
rates  in  Aa  are  fairly  close  to  one ,  and  where  the  brand  specific 
variances  Q^  are  much  greater  than  the  general  variance  component  Qq. 

The  cross  fixed  parameters  follow  a  simple  autoregressive  process, 
with  the  expectation  that  they  stay  where  they  were  last  period.   None 
of  the  disturbances  in  the  parameter  transitions  are  correlated  with 
the  disturbance  in  the  equation  (4).   Similarly,  there  is  no  serial 
correlation  in  any  of  the  disturbances.   Both  of  these  properties  are 
necessary  for  the  decomposition  of  residual  error  that  is  followed  in 
the  estimation  stage  (see  below) .   It  should  be  noted  that  contemporaneous 
correlations  between  the  parameter  shift  disturbances  are  allowed.   Thus, 
a  major  external  shock  to  the  system  might  shift  both  the  cross  fixed 
and  the  cross  varying  parameters  around. 

The  general  structure  encompassed  by  (4)  through  (6)  comprises  a 
few  interesting  special  cases.   If  the  divergence  rates  Ax   are  set  to 
zero,  and  there  are  no  cross  fixed  parameters,  the  resulting  structure 
represents  one  version  of  a  random  coefficient  model.   The  coefficient 
vector  for  any  one  time  period  consists  of  the  mean  values  a^  and  the 
variances  of  rinf   Similarly,  a  type  of  an  error  component  regression 
model  might  be  developed  by  making  all  slope  coefficients  cross  fixed, 
allowing  the  R^^,  n»l,...,N,  to  pick  up  the  Individual  effects.  The  only 
problem  is  that  the  time  effect  is  assigned  to  the  cross  fixed  parameter 
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through  Y*.  3^^^   f^ot  the  equation  itself,  x^hich  means  that  the  parameter 
estimates  will  vary  over  time.   Thus,  it  does  not  seem  that  the  usual  form 
of  an  error  component  model  —  where  the  coefficients  are  assutaed  fixed 
over  all  units  and  time  periods  —  becomes  a  proper  special  case  of  the 
convergent  parameter  regression  model. 
Es t iipat ion  Pro cedu re 

The  estii.aation  of  the  parameters  in  the  convergent  structxire  is 
thoroughly  developed  by  Rosenberg  and  we  need  only  cover  it  very  briefly 
here.   Basically,  the  first-order  Markovian  property  of  the  parameter 
transitions  assumed  for  the  cross  varying  and  the  cross  fixed  coefficients, 
together  with  the  serial  independence  in  the  errors ^  imply  that  the  so- 
called  Kalraan  filter  well  knot^?n  frota  control  theory  can  be  applied. 
This  leads  to  a  recursive  approach  for  the  estimation,  using  the  fact  that 
one  only  needs  to  know  the  preceding  period's  parameter  values  in  order  to 
derive  predictions  on  the  present  period's  values. 

In  Exhibit  3  the  basic  steps  involved  in  the  estimation  procedure  are 
Indicated.   In  order  to  understand  the  operations,  the  following  identifica- 
tions are  necessary. 

^tlt-1  ^^  ^^'^  tnatrisc  of  parameter  estimates  at  t  based  upon  data  through 

$      is  the  matrix  of  transition  (convergence  and  divergence)  rates. 

Q      is  the  matrix  of  containing  all  the  submatrices  of  Q^,  Q^,  Q^a* 
and  Qq. 


See  Aoki,  M.  ,  Optiinigatlon  of  Stochastic  Systems.  New  York:  Academic 
Press,  1967. 
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R      Is  the  matrix  containing  all  the  submatrices  of  R^  and  Rq. 

Ml,  is  the  estimated  —  on  the  basis  of  the  assigned  Q-values 

and  observed  forecast  error  (see  below)  —  variance-covariance 
matrix  of  the  parameters  in  V*.|*.„i» 

Exhibit  4  gives  the  exact  definitions  of  these  matrices. 

Given  a  specification  of  the  stochastic  terms  in  Q  and  R,  the 
transition  rates  in  *,  and  the  first  period  starting  values  for  y  and  M, 
the  first  stage  of  the  estimation  can  be  carried  out.  Here  an  extrapolation 
takes  place,  shown  in  equations  (7)  and  (8),  which  is  based  upon  the 
processes  specified  in  the  parameter  transition  equations  (5)  and  (6) . 
Thus,  the  next  period's  values  of  the  cross  fixed  coefficients  are 
predicted  to  be  equal  to  this  period's,  whereas  the  cross  varying  parameters 
are  updated  using  the  assigned  divergence  rates  in  A..   Similarly,  the 
variance-covariance  matrix  of  these  coefficients  is  extrapolated  to  the 
next  period  using  the  same  divergence  rates  and  in  addition  the  assigned 
Q-values . 

In  the  second  stage  of  the  estimation  these  extrapolated  parameter 
values  are  used  together  with  the  observed  X-values  to  forecast  the 
value  of  the  dependent  variable  for  the  coming  period.   From  this  forecasted 
value  and  its  difference  from  the  actually  observed  y-value  comes  the  basic 
piece  of  evidence  on  how  well  our  initial  parameters  seem  to  hold  up  (the 
"innovation"  in  control  theory  terminology) .   The  question  is  how  much  of 
this  forecast  residual  error  is  due  to  incorrect  parameter  values,  and 
how  much  is  due  to  the  equation  distrubances.   In  the  Kalman  filtering 
approach  this  is  determined  by  the  total  variance  of  the  forecast  error  on 
the  one  hand,  shown  in  equation  (10),  and  the  covarlances  between  the 
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parainster  disturbances  and  the  forecast  error  on  the  other,  shown  in 
equation  (11).  As  the  revision  functions  in  Exhibit  3  show,  the  error 
is  used  to  apdate  the  parasieter  vec  .ors  according  to  uhe  product  of  the 
covariance  matrix  and  the  inverted  variance  matrix c   This  product,  the 
"gain"  in  control  theory  terminology,  represents  in  fact  the  least 
squares  estimator  of  the  linear  relationship  between  the  paranieter 
misestimation  or  "extrapolation  error"  and  the  forecast  error. '^ 

For  the  recursive  process  to  have  a  starting  point,  the  analyst 
needs  to  input  values  for  the  first  period  <.   These  can  be  Bayssian  priors 
and  the  analysis  can  be  carried  out  for  the  typical  Bayeaian  approach 
deriving  posterior  distributions  for  the  parameters.   Alternatively 5  the 
initial  paraisater  values  can  be  set  at  zero,,  representing  diffuse  priors. 
Rosenberg  shows  that  such  a  choice  of  priors,  together  with  the  specifica- 
tion of  the  stochastic  parameters ,  where  the  Q's  can  be  taken  as  starting 
values  for  Mj,  will  not  impair  the  performance  of  the  algorithm,  or  the 
properties  of  the  estiiaates  derived. 
The  Algorithm 

The  algorithm  used  for  the  estitaation  was  supplied  by  Rosenbergo   He 
has  had  one  version  working  at  Berkeley  for  the  last  tvo  or  so  years,  but 
this  did  not  include  seme  of  the  newer  developments  included  in  the  paper 
cited  above »  He  and  I  have  worked  relatively  closely  on  the  finalizing 
and  testing  of  the  algorithm.   It  is  not  yet  finished 5  although  the  basic 


^Sea  Slhodes,  Ian  B.  ,  "A  Tutorial  Introduction  to  Estimation  and  Filtering/* 
12.EE   Transactions  on  Autoiaatic  Control,  Vol.  AC-16,  No.  6  (December  1971),  v. 6 

^Op.cit. ,  p. 409. 
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recursive  estimation  part  is  completed.   If  there  are  still  errors  in 
what  Is  presented  here  I  would  have  to  take  responsibility  for  them. 

The  input  to  the  algorithm  consists  of  the  stochastic  specification 
and  the  data.   Given  the  stochastic  specifications,  the  recursion  will 
yield  maximum  likelihood  estimates  of  the  coefficients.   Full  maximum 
likelihood  estimation  of  all  the  parameters,  including  the  stochastic 
terms  is  only  possible  through  an  iterated  search  over  the  range  of 
feasible  parameter  values.   At  the  present  the  algorithm  carries  out  the 
iterations  through  simple  DO-loops  —  later  the  whole  recursion  routine 
will  be  set  up  as  a  subroutine  for  a  non-linear  search  code. 

The  initial  values  of  the  stochastic  terms  can  be  chosen  in  different 
ways.   There  exists  no  hard  and  fast  procedure  here,  even  though  Rosenberg 
provides  one  possibility  in  his  article."  Basically,  the  program  expects 
■:'-.2  input  values  to  be  of  relative  magnitudes,  with  the  general  disturbance 
in  the  equation,  Rq,  the  standard  of  comparison.   In  the  application  we 
are  concerned  with  here,  t:he  earlier  results  gave  some  indication  as  to 
the  appropriate  values.   Thus,  without  any  cross  fixed  parameters,  Q  and 
*^ca  ^^^^'^'^   zero,  of  course.   Q^  was  set  as  greater  than  Qq  since  the 
coefficients  showed  considerable  interbrand  differences,  and  since  the 
low  significance  levels  indicated  also  high  intrabrand  variations.   The 
divergence  rates  A .  were  set  high  for  both  the  cross  varying  parameters , 
since  the  influence  from  the  common  norm  seemed  quite  negligible.   The  Qaia? 
covariance  was  set  on  the  basis  of  the  correlations  between  the  two 
Independent  variables. 


^Op.cit.,  p. 412. 
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In  tha  approximation  used  —  the  one  referred  to  as  Al  in  Rosenberg's 
paper  —  the  R  j  n='lj..*5N,  were  set  equal  for  ail  the  brands.   This  is 
the  only  avproxiiaation  tested  out  ^  of  ar  and  is  the  one  recotcEiended  by 
Rosenberg.    For  the  case  at  hand^  however,  the  assumption  of  equal  ^-*s 
seems  less  tenable  than  would  have  been  desirable.   It  is  clear  from  the 
separate  brand  runs   that  the  explanatory  power  of  price  differs  considerably 
between  the  various  brands  analyzed,  as  we  saw  in  Table  2.   For  the 
approximation  used,  hov;evarj  the  only  solution  was  to  set  R„  quite  low  and 
allow  the  common  R  to  be  computed  as  the  residual  variance. 

The  starting  values  for  the  coefficients  in  the  first  period  were  the 
ones  derived  by  Rosenberg  for  the  no-prior  case.   Thus,  initially  the 
parameter  vectors  were  set  at  2;ero,  updated  via  the  divergence  rates  in  Aa^ 
and  then  revised  according  to  their  covariances  with  the  forecast  error 
just  as  shown  above.   Only  after  this  revision  will  the  parameters  be 
non-zero,  incidentally.   The  moment  matrix  M  was  initialized  by  the  Q's. 

Since  this  is  a  new  algorithm  a  few  additional  remarks  might  be  in 
order.   The  code  is  written  in  Fortran  IV,  and  requires  about  260K  of  core 
on  an  IBM  360-75.   For  the  15  brands/51  weeks  case  run  here,  time  for  the 
completed  recursion  for  a  given  stochastic  specification  lies  between  25-30 
seconds  of  CPU-tixae.   (This  was  the  case  for  input  data  cosaing  from  disc 
which  decreases  costs  considerably  when  data  files  are  large.)   The  program 
occupies  about  1800  key  punch  cards,  and  should  also  be  run  from  disc 
wherever  feasible. 


^Op.cit.,  p. 414. 
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IV.  RESULTS 
Only  a  few  of  the  esipirical  results  are  necessary  to  present  here. 
The  paraiaecer  estimates  are  not  sia..:iiiiam  likelihood  Since  not  all  possible 
stochastic  specifications  vjere  run.   However >  within  the  fairly  narrow 
range  of  values  tried  the  ones  presented  in  Tables  3  and  4  were  the  best 
according  to  the  value  of  the  likelihood  function  and  the  coefficient 
signs. 


Since  the  introduction  of  one  cross  fixed  consLant  tena  generated 
all  positive  parameter  estimates,  the  input  data  vjere  standardized  on  the 
basis  of  each  brand's  respective  mean  and  standard  deviation.  After  the 
extraction  of  the  constant  terms,  the  results  are  quite  similar  to  those 
of  the  separate  runs  in  Table  2,  and  accordingly  quite  disappointing. -'•^ 
On  the  other  hand,  the  closeness  of  the  two  sets  of  estimates  tends  to 
generate  some  confidence  in  the  estimation  algorithm.   In  the  cases  where 
there  is  a  change  in  sign  most  of  the  early  estimates  were  not  significantly 
different  from  zeroj  and  thus  a  revers^'.l  of  signs  should  not  be  unexpected. 
I  would  not  like  to  say  that  these  results  represent  the  final  work  on  the 
price  effects  in  the  margarine  market,  however. 

Of  most  interest  seemed  the  variations  due  to  the  divergence  index 
sinte  this  as  a  limiting  case  of  zero  led  to  a  random  coefficient  model. 


■*■  The  results  represent  the  parameter  in  the  51st  week.   Since  the  parameter'' 
change  over  time,  they  will  in  general  be  different  between  periods.  Usir 
all  the  51  weeks  of  data  to  estimate  any  one  period's  parameters  requires 
the  use  of  a  "smoothing"  device  not  yet  in  the  algorithm. 
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As  can  be  seen  from  the  tables,  the  results  for  the  random  coefficient 
models  are  somewhat  inferior  to  those  of  the  convergent  model.   This  can 
be  seen  frum  the  different  values  f  the  likelihood  1 unctions  and  the 
signs  of  the  coefficients.   The  differences  tend  to  support  the  idea  that 
pooling  the  data  in  any  way  other  than  allowing  for  great  differences  — 
as  one  does  in  the  convergent  parameter  model  — -   is  not  acceptable  in 
this  case.   It  is  clear  also,  however,  that  the  convergent  structure 
might  in  fact  impose  too  much  restriction  on  the  parameter  variations  of 
the  various  brands. 

The  particular  form  of  the  random  coefficient  model  is  depicted 
clearly  in  the  coefficients  in  the  tables.   As  can  be  seen,  many  are  of 
exactly  the  same  magnitude,  an  expression  of  the  fact  that  the  random 
model  basically  estimates  a  mean  and  a  variance  for  each  coefficient,  and 
the  cross  sectional  units  do  not  usually  become  identified  with  any  one 
particular  coefficient  estimate.   Thus,  the  coefficient  estimates  are 
somewhat  of  an  artifact  for  this  model. 

Judging  from  the  two  tables,  the  stochastic  specification,  does  not 
seem  to  make  much  of  a  difference.   This  was  not  true  for  all  values  tried, 
however.   For  example,  moving  the  Qo's  down  low  relative  to  the  Qq's  gave 
a  considerably  lower  likelihood  value.   Overall,  however,  it  seems  that 
the  approximation  due  to  similar  R^'s  does  not  seem  very  appropriate, 
and  this  might  affect  the  impact  of  the  other  term  in  the  stochastic 
specification. 

In  terms  of  substantive  findings  for  the  different  brands,  it  seems 
that  no  brand  has  a  consistently  negative  price  impact  from  both  absolute 
price  changes  and  relative  changes.   On  the  other  hand,  some  brands 
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(Nucoa  and  Parkay)  seem  to  have  a  consistently  positive  relationship, 
indicating  that  a  rise  in  prices  would  be  beneficial.   It  is  difficult 
to  evaluate  whether  these  results  are  valid  or  not.   If  data  were 
available  on  the  relative  spending  by  these  brands  for  in-store  promotions, 
packaging,  advertising,  and  other  promctional^ef forts  as  well  as  quality 
differences,  one  would  have  a  much  stronger  basis  on  which  to  assess 
these  findings o   It  is  clear,  however j  that  since  some  of  the  left-out 
variables  might  have  had  only  a  limited  variability  during  the  year 
observed  (quality  tuight  be  such  a  constant  factor)  then  the  use  of 
separate  brand  intercepts  would  account  for  the  differences.  Likewise, 
if  there  were  no  consistent  efforts  to  synchronize  the  regular  pricing 
policies  with  the  other  selling  efforts,  these  impact  coefficients  would 
,  not  necessarily  be  spurious.   Since  the  deal  purchases  were  excluded 

from  the  analysis,  such  an  assumption  ■ —  which  is  tantamount  to  the  statistical 
assumption  that  price  variables  are  uncorrelated  with  the  disturbances  ■ — 
might  xveli  be  warranted. 

If  we  issume  that  the  coeffici  acit  estimates  are  non-spurious,  what 
do  they  imply?  Judging  from  the  tables  —  at  least  Table  2  —  the  significance 
of  the  estimates  is  not  very  high,  so  many  of  the  price  effects  might  well 
be  zero.  On  the  other  hand,  four  out  of  30  coefficients  are  different 
from  zero,  and  since  this  is  somewhat  greater  than  chance  (although  not 
muchi)  one  might  want  to  investigate  what  could  possibly  make  for  positive 
versus  negative  price  sensitivities. 

The  negative  part  is  easy.   Clearly,  if  price  of  margarine  matters 
to  the  buyer,  one  would  expect  she  would  rather  buy  the  cheaper  brand 
rather  than  the  more  expensive  one.  Following  a  price  rise  she  could 
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shift  to  another  brand  so  that  the  brand  with  a  relatively  low  price 
would  gain.   In  neither  case  should  the  use  of \ first  differences  in  the 
estimating  model  obscure  the  picture.   The  recursion  used  to  generate  (3) 
leaves  the  actual  slope  coefficients  unchanged. 

Let  us  turn  to  the  positive  price  effect  situation.   One  explanation 
here  would  be  that  a  price  rise  generates  expectations  of  further  price 
rises  J  and  that  a  price  decline  similarly  makes  people  expect  further 
declines.   Such  an  "expectational"  theory  would  lead  to  positive 
coefficients.   Since  margarine  is  purchased  weekly  in  most  cases,  a 
reaction  to  a  price  change  nsight  be  swift  and  it  can  also  be  controlled 
by  delaying  purchases  or  by  buying  large  quantities.   In  either  case^  the 
expectations  of  future  prices  will  play  a  role  in  determining  actual 
behavior.   On  the  other  hand^  if  the  desire  is  to  take  advantage  of  a 
lower  price  —  the  "normal"  reaction  —  but  price  changes  occur  within 
the  week  the  shopper  might  not  have  time  to  take  advantage  of  the 
opportunity,  again  leading  to  positive  coefficients. 

Another  type  of  explanation  of  the  positive  coefficients  lies  with 
the  substitution  effect  from  butter.   It  might  be  that  margarine  and  butter 
prices  vary  together  —  although  the  multicollinearity  observed  earlier 
would  argue  against  it  —  and  that  as  margarine  prices  go  up,  so  do 
butter  prices  (and  vice  versa) .   In  such  a  case  people  might  switch 
from  butter  to  margarine  —  or  the  better  types  of  margarine  —  when 
prlfcea  go  up,  and  back  to  butter  when  prices  decline;  producing  the 
observed  positive  coefficients.   This  hypothesis  has  ad  mentioned  been 
advanced  in  the  past ,  but  there  is  no  real  evidence  here  that  can  be  shown 
in  support  of  it,  more  than  the  positive  coefficients  themselves. 
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It  seems  more  realistic,  perhaps,  to  look  for  an  explanation  in 
terms  of  the  brands  price  dealing  activity.   If,  for  example,  the  price 
coefficients  vary  according  to  the  degree  to  which  the  brand  indulges  in 
deal  promotions  (so  that  a  brand  sold  with  intensive  dealing  efforts  vrLll 
show  less  or  more  of  a  regular  price  sensitivity  than  a  brand  which 
rarely  employs  dealing,  say)  then  our  model  is  misspecified.  It  should 
have  accounted  for  that  fact  by,  for  example j,  introducing  a  deal  intensity 
variable  interactively  with  the  price  variables.  Such  a  deal  variable  could 
be  calculated  as  the  proportion  of  total  units  of  a  brand  purchased  on  a 
deal  basis.   Such  an  approach  would  make  the  price  coefficients  reflect 
the  interactive  effects  of  regular  price  and  dealing,  and  might  have  come 
out  with  more  consistent  and  explicable  signs. 

This  reasoning  is  of  course  based  on  hindsight  and  rationalization. 
It  is  my  guess,  however,  that  a  better  picture  of  the  sensitivity  to 
regular  price  changes  will  have  to  account  for  the  dealing  effects 
directly,  and  not  simply  exclude  them  as  done  here. 

V.   DISCUSSION 

It  seems  probable  that  the  results  obtained  in  the  analysis  of  the 
data  are  incorrect  because  of  misspecification   On  the  substantive  side, 
it  is  likely  that  a  more  explicit  accounting  for  the   dealing  effects  would 
have  led  to  a  different  price-quantity  model  and  different  impact  coefficients 
On  the  statistical  side,  the  approximation  due  to  equal  equation  disturbaiices 
for  all  individual  brands  is  probably  not  very  good  for  these  data^   As  a 
consequence  of  this  potential  misspecification,  the  analysis  cannot  be 
said  to  yield  any  conclusive  results  as  tc  the  actual  price-quantity 
relationship. 
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What  can  be  said  about  tha  usefulness  of  the  convergent  parameter 
model  in  marketing?  It  is  clear  that  the  potential  usefulness  is  very 
great  in  marketing j  but  as  this  api/lication  has  pointed  out,  it  is  as 
vulnerable  to  misspecification  as  other  econometric  techniques  5  and  a 
lot  of  care  should  go  into  its  particular  application.   It  is  more 
flexible  than  most  of  the  other  techniques  proposed  to  handle  over  time, 
across  units  variations,  and  the  algorithm  is  reasonably  fast  considering 
the  computations  involved  in  all  these  techniques.   In  marketing  we  do 
encounter  many  problems  which  lend  themselves  naturally  to  this  type  of 
analysis.   For  example,  the  analysis  of  the  varying  sensitivity  of  a 
product  to  various  marketing  actions  as  it  progresses  through  its  life 
cycle  seems  a  natural  area  of  application.   Similarly,  the  relationships 
between  various  behavioral  constructs,  such  as  attitudes,  and  subsequent 
behavior  might  be  analyzed  here;  or  the  relaitonship  between  salesmen's 
performance  and  their  incentives  as  their  time  with  the  firm  grows. 
Finally,  since  there  simply  are  a  lot  of  marketing  data  around  with 
observations  on  units  over  time,  this  type  of  algorithm  ought  to  provide 
a  good  standard  statistical  tool  in  any  large  data  bank. 
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TABLE  1 
Results  from  the  Pooled  Regression  Runs 


a.  With  one  intercept. 


Standardized 

Regression 

Coefficients 


Uns t andard i z ed 

Regression 

Coefficients 


Multiple 
Correlation 


t-ratios 


n 


^1 


^2 


.51 

-.08 

5.78      -.26       .43       4.04 

.64 

b.   With  separate  brand  intercepts. 

1.05 

-.61 

11.93      "1.95      .49       2»44 

1.45 

a-^   is  the  coefficient  for  the  first  difference  in  unit  price  paid 

a.2   is  the  coefficient  for  the  first  difference  in  the  relative  unit  price  paid 

The  dependent  variable  is  the  first  difference  in  per  capita  units  bought. 
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TABLE  3 
Price  Coefficients  for  ia  Convergent  Parameter  Model  and  a  Random  Coefficient  Model  I 


Stochastic  Specification:  Q„  =  .75;  Q^  =  .90;  Qp   =  .45;  Qp   "  .54;  Q„  ^ 

^1        ^2        ^a,        ^a         12 


.33. 


R  =  .20;  (f   =  .70;  (|   =  .90.   (Divergence  rates  - 

1         2         0.0  for  random  model) 


Brand  Code 


Convergent  Model 
^1  ^2 


-.12 


.13 


Random  Model 


"^1 
-.28 


•*2 
.07 


15 


-.95 


.11 


-.83 


.07 


16 


-.40 


.21 


-.08 


.07 


20 


1.81 


.57 


1.19 


.07 


24 

.40 

43 

-.64 

45 

.60 

48 

.32 

-.05 


-.23 


-.04 


-.30 


.07 


-.57 


.60 


.05 


.07 


.07 


.07 


-.10 


50 


.32 


.05 


.31 


-.32 


52 


-.22 


-.01 


.31 


.06 


57 


.55 


1.56 


.31 


1.42 
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TABLE  3  (Continued) 


Brand  Code 


Convergent  Model 


Random  Model 
^2 


58 


,61 


1.26 


.31 


.31 


61 


73 


88 


.93 


.33 


.27 


-.05 


.99 


-.73 


.31 
(.3^) 

.31 


.31 


-.20 
.97 
.38 


a,  is  the  coefficient  for  the  first  difference  in  unit  price  paid. 

32  is  the  coefficient  for  the  first  difference  in  the  relative  unit  price  paid, 

The  dependent  variable  is  first  difference  in  per  capita  units  bought.  All 
variables  are  standardized  using  the  means  and  standard  deviations  of  the 
respective  brands. 
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TABLE  4 
Price  Coefficients  for  a  Convergent  Parameter  Model  and  a  Random  Coefficient  Model  II 


Stochastic  Specification:  Q   =  .90;  Q   =  .95:  0„   =  .54;  Q^   =  .57;  Q     «  .35, 

^1        H  '     %        Ga2        ^1^2 


Brand  Code 


R  =  ,20;  (|   =  .95;  i   =  .90.   (Divergence  rates  = 
1         2        0.0  for  random  model) 


*1 
-.24 


Convergent  Model 
a,  a^, 


.89 


Random  Model 


*1 

-.30 


"2 

.08 


15 


16 


1.11 


-1.49 


-.53 


•  «5  i 


-.85 


-.08 


.08 
.08 


20 


3.33 


.87 


1.28 


.08 


24 


43 


45 


48 


.21 
-.89 

.74 
1.18 


-.45 


-.35 


-.29 


-.19 


.08 


-.59 
(.97) 

.64 


.05 


.08 
.08 
.08 

-.09 


50 


.33 


.52 


.32 


-.26 


52 


57 


-.41 


.36 


.39 
(2.11) 

-.27 


.32 


32 


.06 

1.38 
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TABLE  4  (continued) 

Brand  Code        Convergent  Model  Random  Model 

^1            *2  *1  ^2 

58          .76           1.55  .32  .31 

61         1.27            .52  .32  -.15 

73         .26            .88  .32  .95 

88          .25           -.86  .32  .40 


a  is  the  coefficient  for  the  first  difference  in  unit  price  paid. 

^2   is  the  coefficient  for  the  first  difference  in  the  relative  unit  price  paid. 

The  dependent  variable  is  first  difference  in  per  capita  units  bought.  All 
variables  are  standardized  using  the  means  and  standard  deviations  of  the 
respective  brands. 


t   . 


EXHIBIT  1 


The  Price-'^ia.ntlty  Modeling 

(1)   q^^  ==  a^  -I-  a^P^^  +  a^B^^^   /  P.,  f  a3P^,  /  Pg,  -!-  e^,,  t.^^l,  ...,  T 


rji  o   -  a     ~  a  (P  -P    ,)  +  a,(P  /P  „-?    ,)  + 
^^'     'nt   ^n,t-l    1  at  n,t-l'    2^  nt   -t  n,c-l 

e  ^  -  e    ,  , 

nt    n,t~l 


-I  ■  1 


I  ~i ,.  // 


1 


EXHIBIT  2 
The  Convergent  Parameter  Model 
A.   Basic  Equation: 

where  E(U^J  -  0,  E(U^^U^^)  =  S^/iS^^   +  V 
In  vector  notation: 

nt     nt  nt'  I  aj.<.f    nt    nt  nt    nt 

where  W»  Z,  C,  and  a  are  now  vectors  for  individual  n  at 
time  t. 


B.   Parameter  Transitions  (in  vector  notation) : 

(5)   C^^i  =  C^  +  Yt    fc-1,  ...,  T-1;  '■    '■ 

^^^   "n,t+l  '\'-\    Kt-\^   "■   \t»  ^=^'  •••'  '^-^'  "=^»  ••-  N' 


where  E(Y.  )  =  0;  E(Y  vh  =  <S„^CJ^Q 


and   E(U^jJ)  =  0;  UV^^^l^)   =  0 
.E<^s^it>  =  ^st<^\a- 


S      (1 


t.'i 


■■!"r',~ 


C    -    t. 


EXHIBIT  3 
Basic  Estimation  Procedure 

A.  Parame' er  Extrapolation 

<"  Mt-i'*Vi|t-i 

(8)  M^|,.i  =  *Mt.l|t-l*  +  <' 

B.  Forecasting 

(9)  e^  =  Y^  -X^_  ,,|,.i 


(10)  F,  =X^M  .^_^xl+R 


(11)  L,  =M,,,_iXl 


C.   Revision 

(12)  u  I   =  U,  K  ,  +  L.  F~^  e^. 

t  t   "^t  t-i    t  t   ^ 


(13)  M  ,   =  M  ,    -  L  ?-l  L 

t  t    t  t-1    t   t   t 


EXHIBIT  4 
Matrix  Specifications  Used  In  Estimation 


Parameter 
Estimate: 

•^tlt-l  = 


^1 


^N 


where  C,  a^  1=1,  ...,  N  are  coefficient  vectors 


Jt  t-1 


The  Q- 

matrlx 

Q  = 


The  R- 
matrix: 

R  = 


^c 
Qca 


Qa  +  Q( 


'ca 


V\  Rg 


R 


G    ^2  "*"  ^ 


\ 


'ca 


Qr.   •  •   Qr 


%   '  % 


»     9     •     •     9     • 


•   R, 


%  +  Rg 


,  where  the  Q's  are  defined 
as  in  Exhibit  2 


,  where  the  R' s  are  defined 
as  in  Exhibit  2 


The 

transition 

matrix: 


*  N 
N 


N 


I-Ari>    I-A. 


N 


0 

I-A 


N 


i 


'fi.  H-  ^-\ 


N 


,  where  the  ^a^   comprise  tha 
transition  rates 


